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Abstract 

The literature on volatility modelling and option pricing is a large and diverse area 
due to its importance and applications. This paper provides a review of the most signif- 
icant volatility models and option pricing methods, beginning with constant volatility 
models up to stochastic volatility. We also survey less commonly known models e.g. 
hybrid models. We explain various volatility types (e.g. realised and implied volatility) 
and discuss the empirical properties. 

Key words: Option pricing, volatility models, risk neutral valuation, empirical volatil- 
ity. 

1. Introduction and Outline 

This paper provides a review of the most significant volatility models and their 
related option pricing models, where we survey the development from constant up to 
stochastic volatility. We define volatility, the volatility types and study the empirical 
characteristics e.g. leverage effect. We discuss the key attributes of each volatility 
modelling method, explaining how they capture theoretical and empirical character- 
istics of implied and reahsed volatility e.g. time scale variance. We also discuss less 
commonly known models. 

The study of volatility has become a significant area of research within financial 
mathematics. Firstly, volatility helps us understand price dynamics since it is one of 
the key variables in a stochastic differential equation governing an asset price. Sec- 
ondly, volatility is the only variable in the Black-Scholes option pricing equation that 
is unobservable, hence the ability to model volatility is crucial to option pricing. 

Thirdly, volatility is a crucial factor in a wide range of research areas. For example, 
contagion effects involve the "transmission" of volatility from one country to another 
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Bau03l |. Volatility can explain extreme events as Blake Bla90l | explains that the 
October 1987 crash could have resulted from volatility changes. 

Finally, volatility has a wide range o f industr ial app lications from pricing exotic 
derivatives to asset pricing models Reb04 1 . Shiller ShiSQj argues the market's volatility 
dynamics can be applied to macroeconomic variables, par ticularl y as the stock market 
is a well known leading indicator of the economy. Shiller Sch8l| also claims volatility 
can be used as a measure of market efficiency. 

Option pricing in itself has become an important research area. Resea rch interest 
in options pricing began with the Black-Scholes option pricing paper BS73| : since then 



the derivatives market has grown into a multi-trillion dollar industry |Sto99| . Options 
have become important to industry, particularly as they can be used to hedge out risk. 
In fact in many situations it is more attractive to speculators and hedgers to trade an 
option rather than an underlying due to the limited loss. Additionally, option trading 
can normally be executed on a far higher level of leverage compared to trading stocks, 
therefore offering potentially higher returns for the same initial deposit. 

The outline of the paper is as follows. Firstly, we review basic financial mathematics 
theory, which is essential for the study of volatility modelling and option pricing. Next, 
we introduce the differing types of volatility and discuss their empirical behaviour e.g. 
leverage effect. We then discuss the key models of volatility and their associated option 
pricing methods. We finally end with a conclusion. 



2. Review of Financial Mathematics Theory 

This section provides a brief exposition of stand ard fina ncial math ematics theory. 
The re commended references on this area are B jork Bjo04l |. Oksendal 0ksO3[ | . Neftci 
Nef96l | and Delbaen and Schachermayer DS06 |. 



2.1. Stochastic Calculus 

2.1.1. Stochastic Differential Equations 



From the time of Bachelier's work BacOOl], the most popular method of modelling 
asset prices has been using stochastic differential equations. Let X{t) denote a stochas- 
tic process (such as a stock price) and we define it as a diffusion if it approximately 
follows the stochastic difference equation: 



X{t + At) - X{t) = fi{t, X{t))At + (T{t, X{t))AW{t), 



(1) 



where: 



AW{t) = W{t + At) -w{ty, 
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• fi{t,X{t)) denotes the drift of 

• a{t,X{t)) denotes the volatihty (also known as the diffusion term). 

The increment AW{t) is from W, which is called a Wiener process. 

Definition 1. A stochastic process W(t) is called a Wiener process if it satisfies the 
following conditions: 

1. W{0) = 0; 

2. W{t) has independent increments. In other words W{u) — W{t) and W{s) — W{r) 
are independent for r < s < t < u; 

3. W{t) has continuous trajectories; 

4. W{t) - W{s) ~ X{0, VT^) for s<t. 

The stochastic process is called a diffusion because the equation models diffusions in 
physics. For continuous time modelling we let At ^ and so the stochastic difference 
equation becomes: 

dX{t) = i2{t,X{t))dt + a{t,X{t))dW, (2) 
forX(O) = a, (3) 

where a is a constant. Alternatively, in integral form the equation becomes: 

X{t)=a + [ ij{s,X{s))ds+ [ a{s,X{s))dW. (4) 
Jo Jo 

It is worth noting in equation (jlj) that the first integral is a standard Riemman integral 
whereas the second integral is a stochastic integral, that is, an integral with respect to 
a Wiener process. 

2.1.2. Stochastic Integrals and Ito's Lemma 

To integrate a stochastic process X(t) we must know the conditions that guarantee 
the existence of the stochastic integral. We can guarantee the integral's existence if 
the process X(t) belongs to the class C^. To define the class we must introduce the 
idea of filtration. 

Definition 2. Let {J-'t} denote the set of information that is available to the observer. 
If 

-^s ^ -^t ^ ^T, Vs, t with s < t <T, (5) 
then the set {J-'t},t G [0,T] is called a filtration. 
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For a given stochastic process X(t), increasingly more information is revealed to 
an observer as time progresses. Hence at time t=a (where a is a constant) some 
information is revealed and this information is known with certainty at any future time 
t > a. To keep track of the information flow that is revealed at time t we introduce 
the filtration J-"^ . 

Definition 3. The notation J-'^ denotes the information generated by process X(t) 
on the interval [0,t]. If based upon the observations of the trajectory of X(s) over the 
interval s G [0, t] it is possible to determine if event A has occurred, then we say A is 
-measurable and write A G . If the stochastic process Z can be determined based 
upon the trajectory of X(s) over the interval s G [0,t] and Z(t) G ,\ft > 0, then we 
say Z is adapted to the filtration {J-'^}t>o- 

We now define the class C^. 

Definition 4. A stochastic process X(s) belongs to the class £^[a,6] if the following 
conditions are satisfied 

• X(s) is adapted to the filtration; 
. llE{X\s)\ds<^. 

If we wish to obtain stochastic differential equations we may require the stochastic 
version of the classical chain rule of differentiation. This is known as Ito's lemma. 

Theorem 1. (Ito's Lemma) Assume that X{t) is a stochastic process with the stochas- 
tic differential given by 

dX{t) = ii{t)dt + a{t)dW, (6) 

where fi and a are adapted processes. Let Z{t) be a new process defined by Z{t) = 
f{X,t) and f is a twice differentiable function, then Z has the stochastic differential: 

WM)^(f + + (7) 

2.1.3. Geometric Brownian Motion 

To model asset prices it is tempting to apply Bachelier's model. On study ing the 



empirical behaviour of stock prices over time, Bachelier in his doctoral thesis BacOO[ | 
first suggested modelling asset prices X{t) with the following equation: 

dX{t) = fiXdt + adW, (8) 
forX(O) = a, (9) 

where a, fi and a are constants. However, Bachelier's equation was not satisfactory; 
theoretically stock prices are always non-negative yet Bachelier's equation allowed neg- 
ative stock prices. We would expect percentage returns to be independent of stock price 
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X(t) yet Bachelier's equation is not. Samuelson in 1965 |Sam65l | introduced the Geo- 
metric Brownian motion (GEM) of stock prices, which is the standard model for stock 
prices to date: 



dX/X = fxdt + adW, 
X{t) = X{0)exp((fi-^a^]t + aW 



In equation (fTUj) X(t) is nonnegative with probabihty one and if X{ti) = then 

X{t) = o,vt > ti. 



(10) 

fll) 



(12) 



This property reflects financial "bankruptcy" since once X(t) equals it will remain 
permanently thereafter. 

If we have a portfolio of n assets X(t) = (Xi(t), X2(t), X„(t)), governed by 
n indep endent Wiener processes W = (Wi{t),W2(t), ....,Wn(t)), we have the model 
Ethoj : 



dXi/Xi = fXidt + ajjdWj, i=l,..,n and j=l,..,n, 



X,{t) = X,{0)exp(^(^fi,-^J2a^J^^ + J^^^ 



where 



(13) 
(14) 



(Tjj is the volatility matrix 



a. 



the volatility of each stock Xi{t) is ai{t) where 



E 



.2 . 



(15) 



(16) 



instantaneous correlation between Wi and Wk is given by 



corr{dWi,dWk) = pik{t)dt 



1 " 
ai[t)ak[t) ^ 



(17) 
(18) 
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2.2. Black-Scholes Option Pricing 



The Black-Scholes analysis of European options |BS73l | yields a closed form solution 
to option pricing, only requiring observable variables (except for volatility). The key 
insight into their analysis was to construct a dynamically replicating portfolio of a 
European option to value it. Options have become one of the most important and 
frequently traded derivatives in finance. We give a definition of a derivative security 
as follows Bin04 |. 



Definition 5. A derivative security (also known as a contingent claim) is a financial 
contract whose value at expiration time T is precisely determined by the price of an 
underlying asset at time T. 

As the name implies, an option gives the right (but not the obligation) to buy or sell 
an asset at a pre-determined price and time. A call option gives the right to buy the 
asset whereas a put gives the right to sell the asset at a predetermined price. One can 
purchase European options, which are options that can only be exercised at expiration 
T, or American options, which can be exercised any time during the life of the option. 

The Black-Scholes portfolio replication argument for European call options begins 
by considering a market consisting of two assets -a riskless bond B(t) and a stock X(t) 
with equations respectively: 

dB{t) = rB{t)dt, (19) 
dX{t)/X{t) = i2dt + adW, (20) 

where a and /i are constants and r is the risk free rate of return. Black and Scholes man- 
aged to determine the closed form value of a European call option C on the assumption 
of no arbitrage (to be defined in section [2^ : 



C{X{t),t, T, r, a, K) = X{t)^{di) - Ke-'^^-'^^{d2), (21) 

, , HX{t)/K) + {r + la'){T-t) 
where d, = , (22) 

HXit)/K)Hr-la^)iT-t) 

= di- a^{T-t). (24) 

In C(X(t),t,T,r,cr,K) t is the time at which C is being priced, T is the expiration date, 
\E'(-) is the standard normal cumulative distribution function and K is the strike price. 
For a more de tailed discussion of the Black-Scholes option pricing equation the reader 
is referred to Duf01 |. 
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2.3. Risk Neutral Valuation 

2.3.1. Martingales 

In financial mathematics, martingale processes are important to understanding key 
concepts, such as the idea of completeness. We therefore define martingales now. 

Definition 6. An Tt — adapted process A4t is an Tt — martingale if 

• E[\ Mt \] < oo,Vt > 0; 

• E[Ms I J^t] = Mt, for s > t. 

If we replace the equality with < (or>) then Ait is a supermartingale (or submartingale 
respectively). 

Whereas a martingale can be considered a fair game, submartingales and super- 
martingales can be considered as favourable and unfavorable games, respectively, from 
the gambler's perspective. A trivial example of a martingale is a constant k; let 
X{t) = k then: 

E[X{t + s)\J^t]=X{t) = k. (25) 

However in financial mathematics we are interested in stochastic processes since we 
wish to model asset prices. We therefore define martingales for stochastic processes. 

Proposition 1. A stochastic process X is a martingale if and only if the stochastic 
differential is of the form 

dX{t) = f{t)dW, (26) 
where f is any process satisfying the condition: 

[ E[f'^{s)]ds < oo. (27) 
Jo 

Therefore for stochastic processes to be martingales they must be "driftless" . 

2.3.2. Girsanov's Theorem: Change of Measure 

In addition to the Black-Scholes equation (equation ( |2T1) ). there exists another 
method for valuing European options, known as risk neutral valuation. Samuelson 
showed that the price of a call option is equal to its discounted expected payoff: 

C{X{t), K, t, T) = e-^^'^-'^E^[X{T) - K]+, (28) 

where P is the probability measure and f is the discount factor for risk. The difficulty 
with this approach is that the discount factor f and the probability measure P vary 
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according to the risk preference of an investor, so were unknown and typically arbi- 
trarily chosen. For instance a risk neutral (or risk indifferent) investor does not require 
an incentive or disincentive to take on a risky investment, therefore if the discounted 
expected value remains constant he will pay the same price for an investment regardless 
of the risk. Furthermore for a risk neutral investor we have f equals the risk free rate of 
return (which is observable), however P was unknown at the time Samuelson proposed 

his option formula^ 

Cox et al. in CRR79| realised that under a Black-Scholes model, option pricing is 
independent of risk preferences. The investor's risk aversion increases with the stock's 
drift /i; yet /i does not appear in the Black-Scholes option pricing equation (equation 
f l^T]) ). By using equation fpHl) . it was shown that all option prices would give the 
same option price regardless of the risk preference chosen provided r and P are chosen 
consistently. Therefore equation ( l28l) and the Black-Scholes formula would always give 
the same answer. 

To value options as a risk neutral investor, we discount at the risk free rate and 
take expectations under the risk neutral measure. To take expectations under the risk 
neutral measure we need to change P in the stochastic differential equation, which 
requires Girsanov's Theorem. Girsanov's Theorem tells us how a stochastic differential 
equation (SDK) changes as probability measure P changes. Essentially, Girsanov's 
Theorem tells us a change in P corresponds to a change in drift fi and the rest of the 
SDK remains unchanged. To explain how we change probability measures, let us define 
probability spaces. 

Definition 7. The triple {Q, JF, P} is called a probability space where Q denotes the 
sample space, the set of all possible events, T denotes a collection of subsets of Q or 
events and P is the probability measure on T or events. The quadruple {Q, JF, {J-'t}t>oj P} 
is called a filtered probability space. 

Assume we have the probability space JF, P} then a change of measure from P 
to Q means we have probability space {Q,J-',Q}. We now state Girsanov's Theorem 
for change of probability measures for SDKs. 

Definition 8. (Girsanov's Theorem) Suppose we have a family of information sets J^t 
over a period [0, T] where T < oo. Define over [0, T] the random process (known as the 
Doleans exponential) 

= exj9 | - ^ X{u)dW^{u) \^{u)dn^ , (29) 

where W'^it) is the Wiener process under probability measure P and \{t) is an Tt- 
measurable process that satisfies the Novikov condition 



exp 2 / ■^'^i'^)^'^ 



<oo,te [0,T]. 



Then is a Wiener process with respect to Tt under probability measure Q, where 
is defined by 



(30) 



W'^it) = W^{t) + f X{u)du, t e [0, T]. 
Jo 

The probability measure Q is defined by 

Q[A] = [ ^tdP,AeJ^t, (31) 

J A 

with A being an event in J^f 

Heuristically, Girsanov's Theorem tells us that a chan ge of m easure is a subtraction 
or addition of an jFj-adapted drift in the Wiener process |BR96| : 



dW^ = dW^ + X{t)dt. 

Our discounted risk neutral process can be expressed as 

dX/X = {fi - r)dt + adW^ , 

= {fi-r)dt + a{dW^ - X{t)dt). 



If we choose 



we obtain equation 



Xit) = {fi-r)/a, 



dX/X = a 



(32) 

(33) 
(34) 

(35) 

(36) 



which is a martingale (see section 12.3. ip since X{t) cancels the drift. This choice of 
A(t) is also kn own as the market price of risk and the Sharpe ratio (discovered by 
Sharpe ShaGGj). Note that only one choice of X{t) in this SDE gives a martingale (or 
alternatively eliminates the drift) hence Q is known as the unique equivalent martingale 
measure. The equivalent martingale measure is also known as the risk neutral measure 
and pricing under this measure is known as risk neutral valuation. 

2. 3. 3. Multidimensional or Multif actor Girsanov 's Theorem 

To change measures for multidimensional SDEs we require the multidimensional 
Girsanov's Theorem, which is very similar to the one dimensional version. 

Definition 9. (Multidimensional Girsanov's Theorem) Suppose we have a family of 
information sets Tt- Let 9(t) = (Ai(t), X2(t), A„(t)) be an n- dimensional process that 
is J^t-measurable and satisfies the Novikov condition: 



exp\ - 



t n 



2 = 1 



Xl{u)du 



< oo. 



(37) 
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We define the random process ^f' 

^expR^ (- K{u)dW!{u) - \ 



\i{u)du , (38) 



where dWf for i=l,...,n is an n- dimensional Wiener process under probability measure 
P. Then under the measure O, is a multidimensional Wiener process defined by: 

= [ \i{u)duJori^l,2,...n. (39) 

Jo 

The probability measure Q is defined by 

Q[A] = I itdP,AeTt. (40) 



J A 

We wish to obtain an equivalent martingale measure Q related to the vector valued 
Wiener process W = {Wi,W2, ■■■,Wn)- Therefore each stock in the multiple stock 
model must be a martingale with respect to Q. Since changing measure means changing 
drift we have a vector 9{t) — {\i{t), ....,\n{t)) to give an n-dimensional Q- Wiener 
process 

W^^Wf+ f K{s)ds. (41) 

In the one dimensional case we obtained a martingale by setting A = in the 
multiple stock model we apply a similar argument but in every dimension: 

A,(^)^ , ^^^"""^ forz=l,..,m. (42) 

When the volatility matrix is invertible then unique Aj solutions exist, admitting a 
unique risk neutral process. Therefore the discounted risk neutral process for the 
multiple stock model is: 

n 

dXi/X, = Y,<^,^dW^. (43) 

2.4- Fundamental Theorems of Finance 

Before stating the fundamental theorems of finance, we must define arbitrage and 
a self-financing portfolio. A portfolio V{t) is self-financing if the change in value of 
the portfolio is a result of the assets' changing values and no external withdrawals or 
additions to the portfolio are made. For instance a portfolio V(t) consisting of ki shares 
of X{t) and k2 bonds of B{t) is a self- financing portfolio if: 

dV{t) = k^dX{t) + k2dB{t). (44) 

We now define arbitrage. 
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Definition 10. An arbitrage possibility in a financial market is a self-financed portfolio 
V{t) such that: 

• ^(0) < 0; 

• V{T) > almost surely and 

• E[V{T)] > 0. 

In words arbitrage is an event where it is possible to make a profit without the 
possibihty of incurring a loss. We say we have a "fair price" when an asset is free 
from arbitrage. We are now in a position to state the two key theorems of financial 
mathematics. 

Theorem 2. (First Fundamental Theorem of Finance) The market is arbitrage free if 
and only if there exists an equivalent martingale measure. 

Theorem 3. (Second Fundamental Theorem of Finance) Assume that the market is 
arbitrage free. The market is then complete if and only if there exists a unique equivalent 
martingale measure. 

In a no arbitrage market that is incomplete we have a variety of no arbitrage option 
prices since we have a variety of martingale measures. Each martingale measure will 
produce a different price, in general. 

Completeness and arbitrage are s eparat e properties and we can illustrate this follow- 



ing the heuristic argument of Bjork BjoOj]. Let Ci72 denote the number of independent 



sources of randomness and let wi denote the number of tradable assets. Now the 
market is arbitrage free if wi < W2 and complete if rui > ru2. Hence the market is 
complete and arbitrage free if zui = rz!2. 

For no arbitrage we must have tui < W2 since each new tradable asset gives an 
opportunity to create an arbitrage portfolio. For example, let W2=l and rz)i=2 and 
each asset follows GBM that are identical (including the same Wiener process since 
W2=l) but have different drifts: 

dXi/Xi = fiidt + adW. (45) 

Then one can produce a riskless profit by shorting the lower drift asset Xi and using 
the proceeds to purchase the higher drift asset X2. For completeness we must have 
wi > W2 to enable us to trade or replicate every possible claim. 

The no arbitrage definition enables us to state the put-call parity relationship for 
European options, which does not require any assumptions other than the market is 
arbitrage free. 
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Proposition 2. (Put-Call Parity) 

Assume the market is arbitrage free. If a European call C(t,X) and a European put 
P(t,X) with the same underlying asset X, strike K and expiration T exist then we have 
the put- call parity relation: 

Pit, X) = Ke-'^^-'^ + C{t, X) - X{t) + D, (46) 

where D is the cash dividend received from the underlying stock during the life of the 
option. 

2.5. Feynman-Kac Stochastic Representation Formula 

The Feynman-Kac theorem provides a hnk between the partial differential equation 

of a diffusion process and its expectation. This is useful because we can solve important 

analytical problems either by taking expectations under a risk neutral measure or by 

solving the partial differential equation. 

Theorem 4. (Feynman-Kac Theorem) 
Let X(t) satisfy the equation 

dX{t) = i2{t, X{t))dt + a(t, X{t))dW, (47) 

with initial value at initial time s 

X{s) = X, (48) 
and let f{s,x) & C"^ be a function which satisifies 



T 

E 







a{t,X{t))^{t,X{t)i' 



dt <oo (49) 



and boundary condition 

f{T,x)=h{x). (50) 
// the function f(s,x) is a solution to the boundary value problem 
df 1 d'^f df 

— + -a^is, a;)^ + /^(s, a;)^ - rf{s, x) = 0, (51) 
then f has the representation: 

f{s,x)=e-'-^^-'^E[h{XiT))\X{s)=x]. (52) 

The Feyman-Kac formula can be extended to the case of W = (^^1,^1^2) where 
Wi,W2 are independent Wien er proc esses and we have two stochastic differentials in 
Xi and X2. Following Shreve Shr04l ] . let the stochastic processes follow 

rfXi(t) = fi^{t,Xi{t),X2{t))dt + aii{t,X,{t),X2{t))dWi + au{t,Xi{t),X2{t))dW2, 

(53) 

dX^it) = fi2{t,X,{t),X2{t))dt + a2i{t,X^{t),X2{t))dWi + a22it,X^{t),X2{t))dW2, 

(54) 
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with initial values at initial time s 



Xi{s) = xu (55) 

X2is) = X2, (56) 



with conditions 



E[{au{t,X,it),X2{t))^^it,X,{t),X2it))r 

+ {au{t,X,{t),X2{t))^{t,X,{t),X2it))f]dt < oo, 

ox I 



E[(a2i(t,Xi(t),X,(t))^(t,Xi(t),X2(t)))^ 



+ {a22{t,X,{t),X2{t))^{t,X,{t),X2{t))f]dt < oo, 

0x2 

with boundary condition 

f(T, xi, X2) = h{xi,X2) (57) 

and satisfy: 

f{s,x,,X2) = e-'''^'^-'^E[h{X,{T),X2{T))\X,{s) = xi,X2{s) = X2]. (58) 

We then have the associated partial differential equation: 

df , ,df , ,df 

= +/il(s,Xi,X2)7^ h /i2(s, Xi,X2) 



as 0x1 0x2 

1 9V 

+ l^{(^lli.SiXi,X2) + al^{s,Xi,X2)) 

+ (0-ii(s, Xi, X2)0-2i(s, Xi, X2) + 0-12(5, Xi, X2)a22(s, Xi, 0:2)) 
1 9V 

+ 2 (^2i(-5,a;i,X2) + 0-22(5, a;i,X2)) ^ - r/(s, xi, 0:2). 



dxidx2 



3. An Introduction to Volatility 

5.1. Different Types of Volatility 



Wilmott in |WilOll ] distinguishes between four different types of volatility, although 



in practise little distinction is made between them. The four types of volatility are: 

• volatility a in equation ([21). It is a measure of randomness in asset return and 
since it exists at each moment in time it has no "timescale" associated with it. 
For example volatility can be 20%, 5%, 60%. 
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• historic volatility (also known as realised volatility): this is a measure of volatility 
using past empirical price data and will be explained further in section 13.21 

• implied volatility: this is the volatility associated with empirical option prices 
and will be explained further in section 13.21 

• forward volatility: this is the volatility obtained from some forward instrument. 

We note that all four types of volatility in theory should not differ since they all refer 
to the same variable cr, however in practise they may be different. For example some 
researchers believe actual volatility and implied volatility are 2 separate variables and 
treat them differently (see Schonbucher's model in section for instance). 

3.2. Historic and Implied Volatility 

In theory volatility should not depend on the method of measurement. However, in 
practice this is not the case. Volatility a can be empirically measured by two methods: 
historic volatility and implied volatility. Historic volatility is calculated from empirical 
(and therefore discrete) stock price data X(to), ....,X(t„) where At = ti — denotes 
the chosen sampling interval. To estimate historic volatility a we calculate the standard 
deviation of an asset's continuously compounded return per unit time: 



Note that sample variance contains n-1 in the denominator, whereas variance of 
a theoretical distribution contains n; 




(59) 



where 



• is the sample variance 




(60) 




(61) 



• X is the sample mean 




(62) 



1=1 
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In contrast to historic volatility we obtain implied volatility values from empirical 
options data. Using Black-Scholes option pricing, call options C are a function of 
C(X,t,T,r,o",K), with all the independent variables observable except a. Since the 
quoted option price C"^'^ is observable, using the Black-Scholes formula we can therefore 
calculate or imply the volatility that is consistent with the quoted options prices and 
observed variables. We can therefore define implied volatility 1 by: 



CBs{X,t,T,ra,K) = C 



obs 



(63) 



where Cbs is the option price calculated by the Black-Scholes equation (equation 
Implied volatility surfaces are graphs plotting I for each call option's strike K and 
expiration T. Theoretically options whose underlying is governed by GBM should have 
a flat implied volatility surface, since volatility is a constant; however in practise the 
implied volatility surface is not flat and I varies with K and T. 

Implied volatility plotted against strike prices from empirical data tends to vary in a 
"u-shaped" relationship, known as the volatility smile, with the lowest value normally 
at X=K (called "at the money" options). The opposite graph shape to a volatility 
smile is known as a volatility frown due to its shape. The smile curve has b ecome a 
prominent feature since the 1987 October crash (see for instance BatOOl ] and CP98I ]). 

Various explanations have been proposed to account for volatility smiles. Firstly, it 
has been suggested that options are priced containing information about future short 
term vol atility that is not already contained within past price information (see for 
instance Jor95l |). Secondly, implied volatility is influenced more by market sentiment 
rather than by pure fundamentals, for instance the VIX (a weighted average of implied 
volatilities from S&P 500 index options) is used as a gauge of market sentiment [SW01|. 

Thirdly, the transaction costs involved in trading options is significantly higher 
and more complicated c ompare d to their associated under lyings, creat ing volatility 
smiles (see for instance PRS99l |). Finally, Jarrow and TurnbuU JT96[ | discuss non- 
simultaneous price observation; since option and stock prices are from two different 
financial markets, there will always be observation time differences. Such time differ- 
ences can cause substantial differences in the estimated implied volatility. 

3.3. Empirical Characteristics of Volatility 

Volatility has been observed to exhibit consistently some empirical characteristics, 
which we will now discuss. Firstly, one of the most well known empiri cal characteristics 
of volatility is the leverage effect, as first proposed by Black Bla76| . Black observed 
that volatility is negatively correlated with stock price and accounted for this through 
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the concept of leverage. Leverage (also known as the gearing or debt to equity ratio) 
is defined by: 

L 



Leverage= 



MKT' 

where l is the company's total debt and MKT is the market capitahsation (number of 
shares x share price). As the share price drops the company becomes riskier, since a 
greater percentage of the company is debt financed, hence increasing volatihty. 

Black argued leverage could not entirely explain volatility since companies with 
little or no debt still exhibit high volatility. Other explanations of volatility's negative 
corelation with stock price include portfolio rebalancing, where investors are forced to 
liquidate their assets if they fall below a threshold price. Alternatively, threshold prices 
may act as triggers for sales when in terpretted in terms of prospect theory, as proposed 
by Kahneman and Tversky KT79l |. Also there exists ownership concentration, where 
an investor owns a subst antial per centage of a company's stock and sells all his holdings 
at once (see for instance CCWOSj). In both large volume of selling pushes prices 



down further, increasing volatility. 

Secondly, volatility (and return distributions) show dependency on the time scale 
At chosen to measure it, as defined in equa tion fl59l) . The return distribution becomes 
increasingly more Gaussian as At increases ConOl| , known as "Gaussian aggregation" , 



yet under GBM volatility is theoretically scale invariant. Such empirical observations 
have motivated researchers to seek models that exhibit scale variation and is one of the 
benefits of mean revertin g stoch astic volatil ity mod els. 

Thirdly, Mandelbrot Man63[ | and Fama Fam65l | were the first to observe volatility 
clusters (positively autocorrelates) with time; large (small) price changes tend to fol- 
low large (small) price changes. Such observations motivated GARCH and stochastic 



volatility models (see section H73l 



known as the "GARCH effect" 



and fo r this reason volatility clustering is sometimes 



Con04l | . The autocorrelation is significant over time 



scales At of days and weeks but insignificant over longer time scales. This is because 
the autocorrelation strength decays following a power law with At increasing. 

The autocorrelation's slow power la w of de cay has been cited as evidence of the 
exi stence o f long memory in volatility ConOl[ | and empirical evidence can be found 



m 



LGS99| . A stationary stochastic process X(t) has lon g memo ry if its covariance 



v{t) = cov{X{t), X{t + r)) follows the power law of decay Tan06 | 



v{t) 



for - 1 < c? < 0, 



(64) 
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and 



^i;(r) = oo. 



(65) 



T = 



Fourthly, volatility tends to be corre lated with significant financial or economic 
variables. For instance Schwert Sch89b( | empirically shows that excessive volatility 
increases during financial crises; during the Great Depression (1929-39) stock volatility 
increased by a factor two or three times its typical values. 

Finally, in commodities and other asset classes seasonal effects and calend ar time 
play a significant role in return and volatility distributions. For instance, French FreSOl ] 
noticed stock returns measured over the weekend on average are more negative than 
any other day of the week. In the past decade the increasing availability of intraday 
data has enabled researchers to show tim es of the day impact volatility, therefore 
complicating intraday volatility estimation GSWOlj. 

Other volatility empirical characteristics include mean reversion, where volatility 



SS91II. This has motivated mean revert- 



tends to revert around some long term value 
ing volatility models (see s ection 1^751 for more detail). Volatility tends to be correlated 
with hi gh trad ing volume SX03[ | and company specific news (e.g. earnings announce- 
ments) Ros96| . Finally Shiller claims market volatility can be explained in terms of 
psyc hologic al factors su ch as f ashion, herding effects and overreaction to new informa- 



tion 



Shi89 |. Shiller in Shi89| also proposes a stock price model taking into account 



psychological factors. 



4. A Review of Volatility and Option Pricing Models 



As described in section I2.1.3[ Bachelier |BacOCll | proposed a model for stock prices, 
with constant volatility (see equation ([H])). Bachelier reasoned that investing was the- 
oretically a "fair game" in the sense that statistically one could neither profit nor lose 
from it. Hence Bachelier incl uded th e Wiener process to incorporate the random na- 
ture of stock prices. Osbor ne Osb59| conducted empirical work supporting Bachelier's 
model. Samuelson Sam65[ | continued the constant volatility model under the Geomet- 



ric Brownian Motion stock price model (see equation ( JTOl) ) on the basis of economic 
justifications. 

Over time, empirical data and theoretical arguments found constant volatility to 
be inconsistent with the observed market behaviour (such as the leverage effect, as 
discussed in section [3731) . A plot of the empirical daily volatility of the S&P 500 index 
clearly shows volatility is far from constant. This consequently led to the develop- 
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ment of dynamic volatility modelling. Volatility modelling may be classified into four 
categories: 

1. Constant volatility a; 

2. Time dependent volatility cr{t); 

3. Local volatility: volatility dependent on the stock price a{X(t)); 

4. Stochastic volatility: volatility driven by an additional random process a{uj). 

These models will now be discussed in some detail in the subsequent sections. 

It is worth mentioning that some models do n ot use Wiener processes to capture 
price movements. For example, Madan and Seneta MS9Cll | propose a Variance- Gamma 
model where price movements are purely go verned by a discontinuous jump process. 
Carr et al. discus s variou s jump processes in |CGMY02[ | such as the pure jump process 
of Cox and Ross [CR76|. The advantages of pure jump models are: firstly, they real- 



istically reflect the fact that trading occurs discontinuously, secondly price jumps are 
consiste nt with information releases according to Fama's "Efficient Market Hyp othesis " 
Fam65l | and finally, stock price processes jump even on an intraday time scale CT04l |. 

4.1. Time Dependent Volatility Model 

As mentioned in section 13. 3[ it was empirically observed that implied volatility 
varied with an option's expiration date. Consequently, a straight forward extension 
propose d to the constant volatility model was time dependent volatility modelling 
Wl98|: 



dX/X = fidt + a{t)dW. 



(66) 



Merton 



Mer73[ | was the first to propose a formula for pricing options under time 



dependent volatility. The option price associated with X is still calculated by the 
standard Black-Scholes formula (equation (1211) ) except we set a = cxc where: 



T-t 



T 



(r'^{T)dT, 



(67) 



i.e. di and d2 in the Black-Scholes equation become: 

^o^?(f)+MT-t) + ^/fa2(r)rfr 



di 



do 



^j7^'ir)dr 
l09(^§^+f^{T-t)-^J^a\T)dT 



f^a'^{T)dT 



(6J 



(69) 



The equation ( l67j) converts a{t) to its constant volatility equivalent Uc over time period 
t to T. The distribution of X(t) is given by: 



(70) 



Note that the constant volatility cTc changes in value as t and T change. This property 
enables time dependent volatility to account for empirically observed implied volatilities 
increasing with time (for a given strike). 

4-2. Local Volatility Models 

4-2.1. Definition and Characteristics 

In explaining the empirical characteristics of volatility, a time dependent volatility 
model was found to be insufficient. For instance, time dependent volatility did not 
explain the volatility smile, nor the leverage effect (see section 13.31) . since volatility 
cannot vary with price. Therefore volatility as a function of price (and optionally 
time) was proposed, that is local volatility is cr = f{X,t): 

dX/X = i^dt + a{X, t)dW. (71) 

The term "local" arises from knowing volatility with certainty "locally" or when X is 
known -for a stochastic volatility model we never know the volatility with certainty. 

The advantages of local volatility models are that firstly, no additional (or un- 
tradable) source of randomness is introduced into the model. Hence the models are 
complete, unlike stochastic volatility models. It is theoretically possible to perfectly 
hedge contingent claims. Secondly, local volatility models can be also calibrated to per- 
fectly fit empirically observed implied v olatility surfaces, enabling consistent pricing of 



the derivatives (an example is given in Dup97l |). Thirdly, the local volatility model is 



able to account for a greater degree of empirical observations and theoretical arguments 
on volatility than time dependent volatility (for instance the leverage effect). We will 
now look at some common local volatility models. 

4.2.2. Constant Elasticity of Variance M odel ( C EV) 



CEV was proposed by Cox and Ross [CR76|]: 



dX 

— = fxdt + a{X)dW, (72) 
X 

a{X) = aX''-\ (73) 

where 0<'n,<l,a>0 are constants. For n = we obtain Bachelier's model 
(equation ([8])) while n = 1 gives the Geometric Brownian Motion model (equation 
( |T0|) ). Therefore n can be seen as a parameter for intuitively choosing between the two 
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extreme models. Additionally, n captures the level of leverage effect since volatility 
increases as stock price decreases, as can be seen from equation flTHl) . 

The CEV model is analytically tractable, which is in contrast to the local volatility 
model in se ction 14.2.41 where numerical solutions for derivatives become analytically 
intractable WlOSj. Additionally, by appr opriate choices of a and n we can fit CEV to 
volatility smiles (see for instance Beckers Bec80|]). 

The CEV model has been developed over the ye ars givi ng; various CEV modified 
models. For instance th e square root CEV model CR76l |. which is similar to the 
CIR interest rate model CIJR85[ |. Schroder Sch89al | re-expresses the CEV model in 
terms of a chi -squared distribution, enabling derivation of closed form solutions. Hsu 
et al. HLLOSl ] determine the CEV model's probability density function while Lo et al. 
LYHOOj l derive the option pricing formula for CEV with time dependent parameters. 

4-2.3. Mixture Distribution Models 

Mixture distributions have been applied to nurnerous statistical applications for 
many years outside financial mathematics (Everitt EH8l| gives a survey of such ap- 
plications). It had been known for some time that one could capture many empirical 
characteristics of stock prices such as volati lity smiles through mixtu re distr ibutions 
(see for instance Bingha m and Kiesel BK02I ] and Melick and Thomas MT97l |). How- 
ever Brigo and Mercurio BMOCll | were the first to prove the theoretical relation between 
volatility and a mixture of lognormal distributions. 

Let us assume the risk neutral probability density function p{t, X) at time t of a 
stock price X(t) is a weighted sum of N lognormal probability densities pi{t,X): 



N 



p{t,x) = J2^^(tMt,x), 



i=l 



N 



where ^^mj(t) = 1, mj(t) > 0, Vt. 



(74) 



(75) 



i=l 



The term Wi{t) is a weighting for each component i at a point in time t, Pi{t, X) denotes 
the probability density of component i at a specific point in time for a specific stock 
price X(t). We assume each Pi{t, X) has the same mean /i but different variances crf{t). 
Brigo and Mercurio then proved that the stock price follows the process: 



dX/X = fxdt + a{X,t)dW, 



N 



where cj2(X,t) = '^Wi{t)ai{t). 



(76) 
(77) 



i=l 



The mixt ure dis t ribution m odel has b een developed further by Brigo et al. (see 
for instance [BMQa] , |bMSQ3J , [BMRSn^ ). Alexander has also developed the mix- 
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ture distrib ution model; Alexander has applied it to the areas of stress testing portfo- 
ios 



ASOSj, combining it with GARCH processes AL06l | and bivariate option pricing 



AS04l |. GARCH processes will be covered in more detail in section 



The mixture distribution models have been used to model more complicated volatil- 
ity mo dels due to their ability to capture a variety of distributions. For example, Leisen 



LeiOSi ] shows how a mixture distribution model approxima tes Mer ton's jump diffusion 



model (to be covered in section 1^7^ amongst others, Lewis 
models can approximate stochastic volatility models. 

4.2.4- Implied Local Volatility 



Lew02l | shows how mixture 



Dupire Dup94l ]. Derman and Kani DK94l | proved for local volatility that a unique 
risk-neutral process existed that was consistent with option data. The price of a Eu- 
ropean call option by risk neutral valuation is given by: 



C = e-'^E'^[X(T)-K]- 



-rT 



(X(T) - K)p{X{T))dX, 



(78) 
(79) 



K 



where p(X(T )) is th e risk neutral probability density function for X(T). Breeden and 
Litzenberger BL78| then showed from equation (J79l) that the risk neutral cumulative 
distribution function Fi-) at K is: 



dC 
dK 



—e 



-rT 



F{X{T) > K). 



^0) 



Furthermore, the risk neutral probability density function p(X(T)=K) is 



p{X{T) = K). 



Hence we can recover the risk neutral density p(X(T)) from option data. This proba- 
bility can b e interp reted as the current view of the future outcome of the stock price. 

Dupire Dup94| then showed by applying p(X(T)) from equation ( IHTl) (obtained 
from Breeden and Litzenberger's work) to the Fokker-Planck equation, one could obtain 
Dupire's equation: 



(82) 



where D is the dividend. Rearranging equation 



gives: 



a(X,T) 



\ 



9C , , dC 

^ + {r-D)X- + DC 

x^ d^c 
'Ydx^ 



^3) 
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Therefore the local volatility a{X,T) can be fully extracted from option data. 

It can be seen from equation flHHl) that calculating a requires partial differentials 
with respect to T and K. We therefore require a continuous set of options data for 
all K and T. This is highly unrealistic and quoted option pri ces tend to suffer from 
signific ant illiq uidity effects, affecting option bid-ask spreads Nor03l |. Furthermore, 
Finder Fin03[ | shows that option bid-ask spreads are related to volatility, expiry and 



trading volume. Since option data is discrete we require som e interpo lation method 
to convert it to continuous data, fo r exana ple Monteiro et al. MTVOSl ] apply a cubic 
spline method. However, Wilmott WilOCll | states that local volatility computation is 
highly sensitive to interpolation methods. 

Dupire's model implicitly assumes the options data contains all the information on 
the underlying's volatility if we calibrate to options data alone. Howev er there is evi- 
dence to show historic and implied volatility differ significantly CF98l |. Furthermore, 
calibrating the local volatility surface to the options data tends to be unstable with 



time, since the surface significantly changes from one week to another DFW98 |. 



Numerical computation of local volatility has been i mplemen ted by Andersen and 
Brotherto n -Ratcliff e using a finite di fference method ABR98| . Derman and Kani 



DKC96j, |DKZ96j) and Rubinstein 



Rub94| determine local volatilities by fitting a 



unique binomial tree to the observed option prices. Tree fitting also has the computa- 
tional advantage of not being affected by different interpolation methods. 

4-3. Stochastic Volatility Models 
4-3.1. Definition and Characteristics 

Although local volatility models were an improvement on time dependent volatil- 
ity, they possessed certain undesirable properties. For example, volatility is perfectly 
correlated (positively or negatively) with stock price yet empirical observations suggest 
no perfect correlation exists. Stock prices empirically exhibit volatility clustering but 
under local volatility this does not necessarily occur. Consequently after local volatility 
development, models were proposed that allowed volatility to be governed by its own 
stochastic process. We now define stochastic volatility. 

Definition 11. Assume X follows the stochastic differential equation 



dX/X = fxdt + a{uj)dWi. 



Volatility is stochastic if ct(co') is governed by a stochastic process that is driven by an- 
other (but possibly correlated) random process, typically another Wiener process dW2. 
The probability space {Q, JF, F) is Q = C([0, oo) : M^), with filtration {J^t}t>o represent- 
ing information on two Wiener processes {Wi,W2). 

The process governing cr(a;) must always be positive for all values since volatility can 
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only be positive. The Wiener processes have instantaneous correlation p G [—1,1] de- 
fined by: 

corr{dWi{t), dW2{t)) = pdt. 

Empirically p tends to be negative in equity markets due to the leverage effect (see 
section 13.31) but close to in the currency markets. Although p can be a function of 
time we assume it is a constant throughout this thesis. 

The key difference between local and stochastic volatility is that local volatility is 
not driven by a random process of its own; there exists only one source of randomness 
{dWi). In stochastic volatility models, volatility has its own source of randomness 
{dW2) making volatility intrinsically stochastic. We can therefore never definitely de- 
termine the volatility's value, unlike in local volatility. 

The key advantages of stochastic volatility models are that they captur e a richer 



set of empirical characteristics compared to other volatility models MROSi ] . Firstly, 
stochastic volatility models generate return distributions similar to what is empirically 
observed. For example, the return distribution has a fatter left tail an d peak edness 



compared to normal distri bution s, with tail asymmetry controlled by p Dur07l |. Sec 



ondly, Renault and Touzi [RT96l | proved volatility that is stochastic and p=0 always 
produces implied volatilities that smile (note that volatility smiles do not necessarily 
imply volatility is stochastic). 

Thirdly, historic volatility shows significantly higher variability than would be ex- 
pected from local or time dependent volatility, which could be better explained by 
a stochastic process. A particular case in point is the dramatic change in volatility 



during the 1987 October crash (Schwert Sch9Cll | gives an empirical study on this). 



Finally, stochastic volatility accounts for the volatility's empirical dependence on the 
time scale measured (as discussed in section IH75|1 . which should not occur under local 
or time dependent volatility. 

The disadvantages of stochastic volatility are firstly that these models introduce 
a non-tradable source of randomness, hence the market is no longer complete and 
we can no longer uniquely price options or perfectly hedge. Therefore the practical 
applications of stochastic volatility are limited. Secondly stochastic volatility models 
tend to be analytically less tractable. In fact, it is common for stochastic volatility 
models to have no closed form solutions for option prices. Consequently option prices 
can only be calculated by simulation (for example Scott's model in section 14.3.31) . 

4-3.2. Stochastic Volatility and the Driving Process 

Stochastic volatility models fundamentally differ according to their driving mech- 
anisms for their volatility process. Different driving mechanisms maybe favoured due 
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to their tractability, theoretical or empirical appeal and we can categorise stochastic 
volatility models according to them. Many stochastic volatility models favour a mean 
reverting driving process. A mean reverting stochastic volatility process is of the form 



FPSOOaj: 



^ = f{Y), (85) 
dY = aim-Y)dt + pdW2, (86) 

where: 

• P >0 and /5 is a constant; 

• m is the long run mean value of a; 

• a is the rate of mean reversion. 

Mean reversion is the tendency for a process to revert around its long run mean value. 
We can economically account for the existence of mean reversion through the cob- 
web th eorem, which claims prices mean revert due to lags in supply and demand 
LH92l |. The inclusion of mean reversion (a) within volatility is important, in particu- 
lar, it controls the degree of volatility clustering (burstiness) if all other parameters are 
unchanged. Volatility clusteri ng is a n important empirical characteristic of many eco- 



nomic or financial time series |Eng82| | , which neither local nor time dependent volatility 
models necessarily capture. Additionally, a high ^ can be thought of as the time re- 
quired to decorrelate or "forget" its previous value. 

The equation (|86l) is an Ornstein-Uhlenbeck process in Y with known solution: 

Y{t) =m+ (r (0) - m)e~"* + (3 [ e-^^dVFs, (87) 

Jo 

where Y(t) has the distribution 

Yit) r^Af(^m + (F(0) - m)e-°*, ^(1 - e'^"*)^ . (88) 

Note that alternative processes to equation (IHBl) could have been proposed to define 
volatility as a mean reverting stochastic volatility model, for example the Feller or 



Cox-IngersoU-Ross (CIR) process |FPSnnbj 



dY = a{m-Y)dt + (3VYdW2. (89) 

However with a = f{Y) in equation ( !86l) we can represent a broad range of mean 
reverting stochastic volatility models in terms of a function of Y. 
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4-3.3. Significant Stochastic Volatility Models 

There is no generally accepted canonical stochastic volatility model and a large 
number of them exist, therefore we review here the most significant ones. 

1. Johnson and Shanno Model 



Johnson and Shanno in 1987 |JS87l | proposed one of the first stochastic volatility 
models: 

dX = fiiXdi + aX'^dWi, n>0, (90) 
da = fX2(Tdt + a^(3dW2, k>0, (91) 

where corr{dWi{t),dW2{t)) = pdt. A Monte Carlo method is proposed to deter- 
mine the price of options under the stochastic volatility process by risk neutral 
valuation. Johnson's and Shanno's computational results show that their option 
prices are consistent with what is empirically observed (see section 1331) . that is 
they exhibit a volatility smile and an increase in value with expiry. 
2. Scott Model 



Scott in 1987 |Sco87l | considered the case where one assumes a geometric process 
for stock prices and an Ornstein-Uhlenbeck process for the volatility: 

dX/X = fxdt + adWi, (92) 
da = a{m - a)dt + (3dW2. (93) 

Scott proposed a stochastic mean reverting process based on empirical stock 
price returns and describes a parameter estimation method based on moments 
matching. Scott assumes p=0 to facilitate computation of option prices. The 
option prices for Scott's process are calculated by Monte Carlo simulation and 
he observes a marginal improvement in option pricing accuracy compared to 
standard Black-Scholes option pricing. 
Hull- White Model 



Hull and White modelled volatility as follows |HW87l |: 



dX/X = ^idt + adWi, (94) 
da^/a^ = fX2dt + pdW2. (95) 

The Hull- White model is an important contribution since it provides a closed 
form solution to European option prices when p=0 and for a given risk neutral 
measure 7. Let us define 

r 

T-t 



a2 = -1- / a\s)ds, (96) 
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where o"^ is a random variable. The option price is computed using the standard 
Black-Scholes formula, under a risk neutral measure 7, with volatility V^o^. In 
other words: 



C(t, X, K, T, a{iu)) = E^lCBsit, X, K, T, V a% 



(97) 



The option pricing equation fp7|) provides results consistent with empirically 
observed currency options, where empirically p ~ 0. Furthermore equation (197|) 
is still valid for any stochastic volatility process provided p = FPSOOaJ. For 
correlated volatility, option prices are obtained using Monte Carlo simulation. 
4. Stein and Stein Model 



Stein and Stein in 1991 SS9l| proposed an Ornstein-Uhlenbeck process for volatil- 
ity based on tractability and empirical considerations: 



dX/X = fidt + adWi, 

da = —a{a — m)dt + [3dW2, 



(98) 
(99) 



where p = 0. Note that although equation (1991) implies volatility can b e negative. 
Stein and Stein state that only cr^ is ever applied in calculation SS9l|. A closed 
form solution to option prices is derived for particula r cho ices of risk neutral 
measures, which is in contrast to Johnson and Shanno JS87l | and Scott Sco87l |. 
who only provide numerical methods to option pricing. 
5. Heston model 

Heston's model Hes93l ] created in 1993 stands out from other stochastic volatility 



models because there exists an analytical solution for European options that takes 
in account correlation between dWi and dW2 (although this requires assumptions 
on the risk neutral measure): 



dX/X 
da^ 



pdt + adWi, 

a^)dt + (3adW2. 



aim 



(100) 
(101) 



The da'^ modelling process originated from CIR interest rate model CIJR85 |. 
To price an option under risk neutral valuation, we must specify a risk neutral 
meas ure (due to market incompleteness) and this is chosen on economic justifica- 
tions Hes93l | . Heston then finds an analytic al solut ion for options using Fourier 

Tansfo rms; the reader is referred to Heston |Hes93l | and Musiela and Rutkowski 

MR05| for a derivation. 

Due to the existence of an analytic solution to option pricing the Heston model 
has been subsequently developed by various researchers. For example, Scott in- 



26 



eludes stochastic interest rates |Sco97l |. Pan inclu des sto chastic dividends |Pan02[ | 
and Bates adds jumps to the stochastic process [Bat96|. 



4.4- Other Volatility Models 



Apart from Bates Bat96| . the models discussed so far all have continuous sample 
paths yet empirical stock prices appear to exhibit "jumps" . Additionally, the frequency 
and magnitude of the jumps are too large to be explained by lognormal distributions, 
which would be obtained under a GBM process. Merton therefore proposed the jump 
diffusion model Mer76[ | . which added a random jump component to the GBM process: 



dX/X = {fi-9k)dt + adW + dV, (102) 



where 



• ^ is the average number of jumps/year; 

• k is the average jump size measured as a percentage of the asset price X; 

• V is a. Poisson process and independent of dW. 

A Poisson process counts t he nurn ber of events that occur in a given period. We define 
a Poisson process 'P(z^) as 



p(X = fc) = ^^^^^,fc = 0,1,2,.... 



(103) 



where 



• z/ is called the rate parameter, where u is the expected number of events per unit 
time; 

• X is a random variable denoting the number of events; 

• p(X=k) is the probability that the random number of events from to time t 
equals k. 

In the case of the jump diffusion model the events are the jumps themselves. To obtain 
a closed form solution for option prices under the jump diffusion model, Merton makes 
a key argument as follows. We know that a rep licatin g portfolio constructed in the 
derivation of the original Black-Scholes equation ( BS73| ) will eliminate the risk arising 
from the GBM and so must earn the risk free rate of return. Next Merton assumes the 
jump process represents nonsystematic risk and risk that is not priced into the market. 
Therefore the same replicating portfolio applied in the Black-Scholes equation must 
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earn the risk free rate of return. Merton then proved that the option price is the sum 
of an infinite series of options, valued by the standard Black-Scholes option pricing 
equation without the jumps. 



C{X{t),K,t,T,r,a) = ^ ^p^CBs(X(t), iT, t, T, r„, a,), 

n=0 

where 

r = T-t, 

e' = e{i + k), 

nln(l + k) 
Tn = r-9k + ^ 

T 

2 2 , 

The n*^ option in the series is valued by the standard Black-Scholes equation, C^s, 
assuming n jumps have occurred before e xpiry. Since the series converges exponentially 



it can be implemented computationally CT04l |. The term arises from the fact that 



a Poisson process can be approximated by a normal distribution. Therefore we can 
assume the logarithm of the jumps are normally distributed with variance q"^. The 
jumps fatten the return distribution's tail, therefore the model is more consistent with 
empirically observed distributions compared to GEM. Merton accounts for jumps as the 
arrival of new information that have more than a marginal effect on price movements. 

Another class of volatility models is to use a stochastic process to model the evo- 
lution of the implied volatility directly. In all the models discussed so far, we have as- 
sumed implied volatility and the underyling's volatility are the same. Implied volatility 
methods model an option's (or any derivative's) implied volatility a* separately from 
the underlying's volatility cr. One significant implied volatility model is Schonbucher's 



stochastic implied volatility model Sch99l | . Schonbucher models the implied volatility 



a* of a vanilla option as a SDE and the underlying's SDE separately: 

da* = fi2dt + a2dWi + (3dW2. (104) 

The underlying's SDE is: 

dX/X = fiidt + aidWi. (105) 

Another class of volatility models is the lattice approach, with its origins f rom bi- 
nomial trees by Cox et al. CRR79]. Britten- Jones and Neuberger BJNOo| model 



stochastic volatility by fitting a lattice model that is consistent with observed option 
prices. The model is parameterised by up and down price movements and their asso- 
ciated probabilities for each branch, using empirical option data as its input. This is 
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an improvement on Derman's and Kani's model, who fit a tree to local volatility only. 
Britten- Jones and Neuberger also show how a variety of stochastic volatility models 
(such as regime switching volatility) can be fitted to be consistent with observed option 
price data. 

Discrete ti me vo latility models are another class of models that exist, such as 
GARCH(p,q) [BolSGj, the generalised autoregressive conditional heteroscedasticity model: 



ao 



(106) 



1=1 



i=l 



where a, and bi are weighting constants. We obtain the ARCH(q) |Eng82l | model by 
setting p=0. Due to the success of GARCH in econometr ics it h as been s ubstan- 
tially extend ed by v arious researchers, for example Nelson Nel9l| . Sentana Sen95 | 
and Zakoian Zak9J]. Whereas with continuous time modelling one is able to derive 
closed form solutions, which can reduce computation and provide new insights, this is 
generally not possible with discrete time models. 

Finally, one class of volatility models are "hybrid" models; combining various 
volatility models into o ne. Fo r instance Alexander Ale04l | extends Brigo's and Mer- 
curio's mixture model BM00l | by combining it with the bino mial tree of Cox et al. 



CRR79l | to incorporate stochasti c volat ility. The SABR model HKLW02I ] is a stochas- 



tic extension of the CEV model CRTGI ] . where SABR is an abbreviation for stochastic 
alpha, beta and rho in its equations. The SABR model captures the dynamics of a 
forward price F of some asset (e.g. stock) under stochastic volatility; its risk neutral 
dynamics under measure Q is: 

dF{t) = a{t)F^{t)dW^, (107) 
da{t) = aa{t)dW^, (108) 
where corr{dWi.dW2) = pe [-1, 1], < /3 < 1, a > 0. (109) 

A common shortfall in all the volatility models reviewed so far has been that all 
these models are short term volatility models. The models implicitly ignore any long 
term or broader economic factors influencing the volatility model, which is empirically 
unrealistic and theoretically inconsistent. Furthermore, although some models may 
specify a closed form solution for option pricing, they provide no method or recom- 
mendation for calibration, which is important to modelling and option pricing. 

5. Conclusions 

This paper has surveyed the key volatility models and developments, highlighting 
the innovation associated with each new class of volatility models. In conclusion it can 
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be seen from our review of volatility models that the development of has progressed in 
a logical order to address key shortcomings of previous models. 

Time dependent models addressed option prices varying with expiration dates, local 
volatility also addressed volatility smiles and the leverage effect, whereas stochastic 
volatility could incorporate all the effects captured by local volatility and a range of 
other empirical effects e.g. greater variability in observed volatility. However the trade- 
off associated with improved volatility modelling has been at the expense of analytical 
tractability. 
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